Dual Control with Active Learning using Gaussian Process Regression

نویسنده

Tansu Alpcan

چکیده

In many real world problems, control decisions have to be made with limited information. The controller may have no a priori (or even posteriori) data on the nonlinear system, except from a limited number of points that are obtained over time. This is either due to high cost of observation or the highly non-stationary nature of the system. The resulting conflict between information collection (identification, exploration) and control (optimization, exploitation) necessitates an active learning approach for iteratively selecting the control actions which concurrently provide the data points for system identification. This paper presents a dual control approach where the information acquired at each control step is quantified using the entropy measure from information theory and serves as the training input to a state-of-the-art Gaussian process regression (Bayesian learning) method. The explicit quantification of the information obtained from each data point allows for iterative optimization of both identification and control objectives. The approach developed is illustrated with two examples: control of logistic map as a chaotic system and position control of a cart with inverted pendulum.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Nonparametric Disturbance Correction and Nonlinear Dual Control

Automatic control is an important aspect of modern technology, and many devices we use on a daily basis are using automatic control for actuation and decision-making. However, many advanced automatic control methods need a model of the system to control—a mathematical representation of the system’s behavior. These models are not always easy to come by because of the underlying complexity of the...

متن کامل

Gaussian Process Based Dual Latent Function Approach to Ordinal Regression

The Gaussian process prior formulation introduced by us in this paper learns a mapping for ordinal regression task using dual sets of latent functions. In this formulation one set of latent functions are associated with data items and the other set of latent functions are associated with entities. An entity is a term introduced by us in this work to refer to the object responsible for assigning...

متن کامل

Dual Control for Approximate Bayesian Reinforcement Learning

Control of non-episodic, finite-horizon dynamical systems with uncertain dynamics poses a tough and elementary case of the exploration-exploitation trade-off. Bayesian reinforcement learning, reasoning about the effect of actions and future observations, offers a principled solution, but is intractable. We review, then extend an old approximate approach from control theory—where the problem is ...

متن کامل

Extensions of Gaussian Processes for Ranking: Semi-supervised and Active Learning

Unlabelled examples in supervised learning tasks can be optimally exploited using semi-supervised methods and active learning. We focus on ranking learning from pairwise instance preference to discuss these important extensions, semi-supervised learning and active learning, in the probabilistic framework of Gaussian processes. Numerical experiments demonstrate the capacities of these techniques.

متن کامل

Adaptive CSI and feedback estimation in LTE and beyond: a Gaussian process regression approach

The constant increase in wireless handheld devices and the prospect of billions of connected machines has compelled the research community to investigate different technologies which are able to deliver high data rates, lower latency and better reliability and quality of experience to mobile users. One of the problems, usually overlooked by the research community, is that more connected devices...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

CoRR

دوره abs/1105.2211 شماره

صفحات -

تاریخ انتشار 2011

Dual Control with Active Learning using Gaussian Process Regression

نویسنده

چکیده

منابع مشابه

Nonparametric Disturbance Correction and Nonlinear Dual Control

Gaussian Process Based Dual Latent Function Approach to Ordinal Regression

Dual Control for Approximate Bayesian Reinforcement Learning

Extensions of Gaussian Processes for Ranking: Semi-supervised and Active Learning

Adaptive CSI and feedback estimation in LTE and beyond: a Gaussian process regression approach

عنوان ژورنال:

اشتراک گذاری